Multi-aspect Entity-Centric Analysis of Big Social Media Archives
نویسندگان
چکیده
Social media archives serve as important historical information sources, and thus meaningful analysis and exploration methods are of immense value for historians, sociologists and other interested parties. In this paper, we propose an entity-centric approach to analyze social media archives and we define measures that allow studying how entities are reflected in social media in different time periods and under different aspects (like popularity, attitude, controversiality, and connectedness with other entities). A case study using a large Twitter archive of 4 years illustrates the insights that can be gained by such an entity-centric multiaspect analysis.
منابع مشابه
Time-Aware Entity Linking
Entity Linking is the task of automatically identifying entity mentions in a piece of text and linking them to their corresponding entries in a reference knowledge base like Wikipedia. Although there is a plethora of works on entity linking, existing state-of-the-art approaches do not explicitly consider the time aspect and specifically the temporality of an entity’s prior probability (populari...
متن کاملBig Data Methods for Computational Linguistics
Many tasks in computational linguistics traditionally rely on hand-crafted or curated resources like thesauri or word-sense-annotated corpora. The availability of big data, from the Web and other sources, has changed this situation. Harnessing these assets requires scalable methods for data and text analytics. This paper gives an overview on our recent work that utilizes big data methods for en...
متن کاملTowards Multi-Screen Social TV with Geo-Aware Social Sense
The increasing popularity of social interactions and geo-tagged user generated contents (UGC) has been transforming the television (TV) viewing experience, from a “laid-back” video watching behavior into a “lean-forward” social-engaged experience. This article envisions a multi-screen social TV integrated with social sense via a second screen as a novel paradigm for content consumption. This ne...
متن کاملTimeMachine: Entity-Centric Search and Visualization of News Archives
We present a dynamic web tool that allows interactive search and visualization of large news archives using an entity-centric approach. Users are able to search entities using keyword phrases expressing news stories or events and the system retrieves the most relevant entities to the user query based on automatically extracted and indexed entity profiles. From the computational journalism persp...
متن کاملBringing Structure to Text: Mining Phrases, Entity Concepts, Topics, and Hierarchies
Mining phrases, entity concepts, topics, and hierarchies from massive text corpus is an essential problem in the age of big data. Text data in electronic forms are ubiquitous, ranging from scientific articles to social networks, enterprise logs, news articles, social media and general web pages. It is highly desirable but challenging to bring structure to unstructured text data, uncover underly...
متن کامل